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INTERNATIONAL STANDARD © ISO/IEC ISO/IEC CD 6937 


Information technology - Coded graphic character set 
for text communication - Latin alphabet 


1 Scope 


This International Standard 
a) specifies the coded representation of the characters; 


b) specifies a repertoire of the Latin alphabetic and non-alphabetic characters for the communication of text in many 
European languages using the Latin script; 


c) specifies rules for the definitions and use of graphic character subrepertoires, i.e. subsets of the specified 
character repertoire. 


2 Conformance and implementation 


2.1 Conformance 
2.1.1 Conformance of information interchange 


A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with 
this International Standard if all coded representations of characters within that CC-data-element conform to the 
mandatory requirements of this International Standard. 


A claim of conformance shall identify: 


- the subrepertoire in accordance with clause 9, if one has been adopted, 
- the 7-bit coding in accordance with Annex A, if it has been adopted. 


2.1.2 Conformance of devices 


A device is in conformance with this International Standard if it conforms to the requirements of 2.1.2.1 and either 
or both 2.1.2.2 and 2.1.2.3 below. 


2.1.2.1 Device description 

A device that conforms to this International Standard shall be the subject of a description that identifies the means 
by which the user may supply characters to the device, or may recognize them when they are made available to 
the user, as specified respectively in 2.1.2.2 and 2.1.2.3 below. 


2.1.2.2 Originating devices 


An originating device shall allow its user to supply any sequence of characters of the character repertoire, and shall 
be capable of transmitting their coded representations within a CC-data-element. 


2.1.2.3 Receiving devices 

A receiving device shall be capable of receiving and interpreting any coded representation of characters that are 
within a CC-data-element, and that conform to 2.1.1 of this International Standard, and shall make the 
corresponding characters available to its user in such a way that the user can identify them among those of the 
repertoire, and can distinguish them from each other. 


2.2 Implementation 


The use of this character set requires definitions of its implementation in various media. For example, these could 
include magnetic and optical interchangeable media and transmission channels, thus permitting interchange of data 
to take place either indirectly by means of an intermediate recording on a physical medium, or by local connection 
of various units (Such as input and output devices and computers) or by means of data transmission equipment. 


The implementation of this coded character set in physical media and for transmission, taking into account the need 
for error checking, may be the subject of other International Standards. 


3 Normative references 

The following standards contain provisions which, through reference in this text, constitute provisions of this 
International Standard. At the time of publication, the editions indicated were valid. All Standards are subject to 
revision, and parties to agreements based on this International Standard are encouraged to investigate the 
possibility of applying the most recent editions of the standards indicated below. Members of IEC and ISO maintain 
registers of currently valid International Standards. 

ISO/IEC 2022:1994, Information technology - Character code structure and extension techniques. 


ISO/IEC 7350:1991, Information technology - Registration of repertoires of the graphic characters from 
ISO/IEC 10367. 


ISO/IEC 10367:1991, Information technology - Standardized coded graphic character sets for use in 8-bit 
codes. 


ISO/IEC 10538:1991, Information technology - Control functions for text communication. 


ISO/IEC 10646:1998, Information technology - Universal Multiple-Octet Coded Character Set (UCS) - Part 1: 
Architecture and Basic Multilingual Plane (BMP) (including AMD 1-9 and COR 1). 


4 Definitions 


For the purposes of this International Standard, the following definitions apply: 


4.1 active position: The character position which is to image the graphic symbol representing the next 
graphic character or relative to which the next control function is to be executed. 


4.2 bit combination: An ordered set of bits used for the representation of characters. 
4.3 character: A member of a set of elements used for the organization, control or representation of data. 
4.4 character position: The portion of a display that is imaging or is capable of imaging a graphic symbol. 


4.5 coded-character-data-element (CC-data-element): An element of interchanged information that is 
specified to consist of a sequence of coded representations of characters, in accordance with one or more identified 
standards for coded character sets. 


NOTE 1 In a communication environment in accordance with the Reference Model for Open Systems Interconnec- 


tion of ISO 7498, a CC-data-element will form all or part of the information that corresponds to the Present- 
ation-Protocol-Data-Unit (PPDU) defined in that International Standard. 


NOTE 2 When information interchange is accomplished by means of interchangeable media, a CC-data-element 
will form all or part of the information that corresponds to the user data, and not that recorded during formatting 
and initialization. 


4.6 coded character set; code: A set of unambiguous rules that establishes a character set and the one-to-one 
relationship between the characters of the set and their bit combinations. 


4.7 code extension: The techniques for the encoding of characters that are not included in the character set 
of a given code. 


4.8 code table: A table showing the character allocated to each bit combination in a code. 
4.9 control character: A control function the coded representation of which consists of a single bit combination. 


4.10 control function: An element of a character set that affects the recording, processing, transmission or inter- 
pretation of data, and that has a coded representation consisting of one or more bit combinations. 


4.11 device: A component of information processing equipment which can transmit, and/or receive, coded 
information within CC-data-elements. 


NOTE 3 It may be an input/output device in the conventional sense, or a process such as an application program 
or gateway function. 


4.12 escape sequence: A string of bit combinations that are used for control purposes in code extension 
procedures. The first of these bit combinations represents the control function ESCAPE. 


NOTE 4 Formats and rules regarding the use of escape sequences are specified in ISO/IEC 2022. 


4.13 graphic character: A character, other than a control function, that has a visual representation normally 
handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations. 


4.14 graphic symbol: A visual representation of a graphic character or of a control function. 


4.15 repertoire: A specified set of characters that are represented by one or more bit combinations of a coded 
character set. 


4.16 text: A representation of information for human comprehension that is intended for presentation in a 
two-dimensional form, for example printed on paper or displayed on a screen. 


Text consists of symbols, phrases or sentences in natural or artificial languages, pictures, diagrams and tables. 
NOTE 5 This International Standard applies only to text made up of characters. 
4.17 text communication; communication of text: The transfer of text by means of telecommunications. 


NOTE 6 In the context of this International Standard, text communication is by means of binary-coded represen- 
tations of characters. 


4.18 user: A person or other entity that invokes the services provided by a device. 


NOTE 7 This entity may be a process such as an application program if the "device" is a code convertor or a 
gateway function, for example. 


NOTE 8 The characters, as supplied by the user or made available to the user, may be in the form of codes local 
to the device, or of non-conventional visible representations, provided that 2.1.2 above is satisfied. 


5 Notation, code table and names 


5.1 Notation 
The bits of the bit combinations of the 8-bit code are identified by bg, 07, bg, bs, by, bg, b» and b4, where bg is 
the highest-order, or most significant bit and b4 is the lowest-order, or least significant bit. 


The bit combinations may be interpreted to represent numbers in the range 0 to 255 in binary notation by attributing 
the following weights to the individual bits: 


In this International Standard, the bit combinations are identified by notations of the form xx/yy, where xx and yy 
are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit 
combinations consisting of the bits bg to b4, is as follows: 


- xx is the number represented by bg, b, bg and bs where these bits are given the weights 8, 4, 2 and 1, 
respectively; 


- yy is the number represented by by, b3, bo and b4 where these bits are given the weights 8, 4, 2 and 1, 
respectively. 


The notations of the form xx/yy are the same as the ones used to identify code table positions, where xx is the 
column number and yy is the row number (see 5.2). 


5.2 Code table 


An 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and rows are 
numbered 00 to 15. 


The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the 
row number. 


The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The 
notation of a code table position, of the form xx/yy, is the same as that of the corresponding bit combination. 


5.3 Names 


This International Standard assigns one name to each character. In addition, it specifies an acronym for the three 
characters SPACE, NO-BREAK SPACE and SOFT HYPHEN and a graphic symbol for the other graphic characters. 
By convention, only capital letters, space and hyphen are used for writing the names of characters. It is intended 
that the acronym and this convention be retained in all translations of the text of this International Standard. 


The names chosen to denote graphic characters are intended to reflect their customary meaning. However, this 
International Standard does not define and does not restrict the meanings of graphic characters. Neither does it 
specify a particular style or font design for imaging the graphic characters. 


6 Specifications of SPACE, NO-BREAK SPACE and SOFT НҮРНЕМ 


6.1 SPACE (SP): A graphic character that has a visual representation consisting of the absence of a graphic 
symbol. Its coded representation is 02/00. 


6.2 NO-BREAK SPACE (NBSP): A graphic character, the visual representation of which consists of the ab- 
sence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 


6.3 SOFT HYPHEN (SHY): A graphic character that is imaged by a graphic symbol identical with, or similar to, 
that representing HYPHEN-MINUS, for use when a line break has been established within a word. 


7 Composition of the character repertoire 


The repertoire of the graphic characters defined in this International Standard consists of 
a) SPACE (SP) 
and of 332 characters as follows 
b) Latin alphabetic characters comprising 
1) the 52 capital and small letters of the basic Latin alphabet, 


2) accented letters, the graphic representations of which consist of combinations of basic Latin letters 
with diacritical marks, 


3) special alphabetic characters which are neither basic Latin letters nor combinations of basic Latin 
letters with diacritical marks; 


c) non-alphabetic characters, such as digits, fractions, punctuation and diacritical marks, monetary symbols etc. 


The repertoire, excluding SPACE, is specified in table 4. In each table entry, the first column specifies the name 
of the character. The second column specifies its coded representation (see 8.3). 


NOTE 9 A survey of the use of Latin characters in various languages is included in Annex D. 


NOTE 10 Use of the following characters: LATIN CAPITAL LETTER L WITH MIDDLE DOT, LATIN SMALL 
LETTER L WITH MIDDLE DOT and LATIN SMALL LETTER N PRECEDED BY APOSTROPHE, is deprecated. 


8 Specification of the coded character set 
8.1 Character sets 


The coded representations of the graphic characters of the repertoire defined in this International Standard make 
use of the character SPACE and of two character sets, that is "a primary set" and a "supplementary set". 


The primary set shall consist of the graphic characters of the basic GO set of ISO/IEC 10367, represented by bit 
combinations 02/01 to 07/14. The characters of the primary set shall not be used in combination with each other 
to generate graphic characters of the repertoire defined in this International Standard. The primary set contains the 
letters of the basic Latin alphabet, some spacing diacritical marks and a number of non-alphabetic characters. 


The supplementary set contains graphic characters, represented by bit combinations 10/00 to 11/15 and 13/00 to 
15/15, and non-spacing diacritical marks, represented by bit combinations 12/00 to 12/15. The graphic characters 
consist of a number of characters used in addition to those in the primary set. 


A non-spacing diacritical mark shall be used only in combination with certain basic Latin letters, or with SPACE. 
The allowed combinations of non-spacing diacritical marks and letters are the ones needed to represent the 
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accented letters included in table 4. This set of combinations is summarized in Annex С. 


The code table for the primary and the supplementary sets of graphic characters is given in table 1. Shaded 
positions denote bit combinations which shall not be used. 


The names of the characters in the primary set are specified in Table 2. 


The names of the characters and non-spacing diacritical marks of the supplementary set are specified in Table 3. 
In order to stress that non-spacing diacritical marks are not characters, the names given to them are printed in 
lower case italics. 


8.2 Explanations concerning the code table 


8.2.1 Bit combinations 10/04 and 10/06 are reserved for future standardization, and shall not be used. 


8.2.2 The non-spacing diacritical marks of column 12 are used only in combination with certain basic Latin letters, 
or with SPACE (see Annex C). The graphic symbols shown in coloumn 12 represent diacritical marks as separate 
graphic characters. 


8.2.3 Bit combinations 12/00, 12/09 and 12/12 are reserved for possible allocation of additional diacritical marks, 
and shall not be used. 


8.2.4 Bit combinations 13/08 to 13/11 and 14/05 are reserved for future standardization, and shall not be used. 
8.3 Coded representations of the graphic characters of the repertoire 


The coded representations of the graphic characters of the repertoire defined in this International Standard are 
specified in table 4. The formats of the coded representations are as follows: 


a) Accented letters 


Each accented letter is represented by a sequence of bit combinations consisting of the coded 
representation of the relevant non-spacing diacritical mark (an element of the supplementary set), 
followed by the coded representation of the relevant basic Latin letter (an element of the primary 
set). 


b) Diacritical marks as separate graphic characters 


The diacritical marks that are elements of the primary set (GRAVE ACCENT, CIRCUMFLEX ACCENT and 
TILDE) are represented as separate graphic characters by the corresponding single bit combination in the 
range 02/01 to 07/14. 


The other ten of the diacritical marks of column 12 are represented as separate graphic characters by a 
sequence of bit combinations consisting of the coded representation of the relevant non-spacing diacritical 
mark (an element of the supplementary set), followed by the coded representation of the character SPACE, 
i.e. the bit combination 02/00. 


c) All other graphic characters of the repertoire 


Any graphic character of the repertoire, other than an accented letter or a diacritical mark as a 
separate graphic character that is not an element of the primary set, is an element of either the 
primary set or the supplementary set and is represented by the corresponding single bit 
combination in the range 02/01 to 07/14 or 10/00 to 15/15. 


Depending of the code extension techniques used, a bit combination, representing an element of either the primary 
or the supplementary set may have to be preceded by a code extension function invoking the character set 
concerned. 


NOTES Explanations concerning certain letters: 


МОТЕ 11 Accented letter LATIN SMALL LETTER G WITH CEDILLA was named "small g with acute accent" 
in the 1983 edition of this International Standard. For compatibility purposes, the coded representation has been 
kept unchanged. The name has been aligned with that in ISO/IEC 10646-1. The cedilla, upturned, is placed above 
"g" for presentation purposes. 


NOTE 12 There is no LATIN CAPITAL LETTER ETH in this International Standard. There is a letter named 
LATIN CAPITAL LETTER D WITH STROKE which will also serve as the capital form of Icelandic Eth, where this 
International Standard is used. It should be noted that ISO/IEC 10646, ISO/IEC 8859-1 and ISO/IEC 10367 provide 
for a LATIN CAPITAL LETTER ETH as well as a LATIN CAPITAL LETTER D WITH STROKE. 


9 Graphic character subrepertoires 


The purpose of defining character subrepertoires is to facilitate communication with equipment capable of 
presenting text using a limited set of graphic characters at one time. An example of equipment that might make 
use of subrepertoires is a text communication terminal containing an output device that has a changeable printing 
element (physical or other). However, in order to comply with the requirements of this International Standard, such 
a text communication terminal has to be capable of receiving and presenting all graphic characters of the repertoire 
in some manner, possibly using one or more alternative printing elements. 


Subrepertoires are defined in accordance with the following rules: 


a) A subrepertoire shall include the character SPACE, the 26 Latin unaccented small letters and the 26 Latin 
unaccented capital letters. 


b) A subrepertoire shall include the 10 digits. 
c) A subrepertoire shall include the following characters: 


Graphic symbol Name 

' APOSTROPHE 

( LEFT PARENTHESIS 
) RIGHT PARENTHESIS 
Я СОММА 

- HYPHEN-MINUS 
FULL STOP 
SOLIDUS 

COLON 

QUESTION MARK 
PLUS SIGN 

EQUALS SIGN 


— o 


locT o ccc 


d) A subrepertoire may include any other graphic characters of the repertoire defined in this International Standard. 
е) A subrepertoire shall not include any character not defined in this International Standard. 

f) Two or more graphic characters of the repertoire shall not be included as a single character in the subrepertoire. 
The procedure for registration of subrepertoires is specified in ISO/IEC 7350. 


The identifier assigned to a registered subrepertoire is intended to be used as a parameter value of the control 
function IDENTIFY GRAPHIC SUBREPERTOIRE (IGS) which is defined in ISO/IEC 10538. 


10 Identification of options 
10.1 Purpose and context of identification 


CC-data-elements conforming to an option of this International Standard are intended to form all or part of a 
composite unit of coded information that is interchanged between a sender and a recipient. The identification of 
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the options of this International Standard that have been adopted by the originator shall also be available to the 
recipient. The route by which such identification is communicated to the recipient is outside the scope of this 
International Standard. 

However, some standards for interchange of coded information may permit, or require, that the coded 
representation of the identification applicable to the CC-data-elements forms part of the interchanged information. 
This clause specifies a coded representation for the identification of options of this International Standard. Such 
coded representations form all or part of an identifying data element, which may be included in information 
interchange in accordance with the relevant standard. 

10.2 Identification of coding method 


The coding method adopted shall be identified by means of one of the following announcer sequences: 
ESC 02/00 04/10 shall identify 7-bit coding (as in Annex A); 


ESC 02/00 04/11 shall identify 8-bit coding. 
10.3 Identification of primary and supplementary sets 


The escape sequences used to designate the primary and the supplementary sets are: 


ESC 02/08 04/02 : to designate the primary set of the present edition of this 
International Standard as the GO set (ISO-IR 6); 

ESC 02/13 05/02 l to designate the supplementarv set of the present edition of 
this International Standard as the G1 set (ISO-IR 156); 

ESC 02/14 05/02 l to designate the supplementarv set of the present edition of 
this International Standard as the G2 set; 

ESC 02/15 05/02 : to designate the supplementary set of the present edition of 


this International Standard as the G3 set. 


NOTE 13 The escape sequences used to designate the primary and the supplementary sets of ISO 6937/2:1983 
are: 


ESC 02/08 04/00 i to designate the primary set as the GO set (ISO-IR 2); 

ESC 02/09 06/12 : to designate the supplementary set as the G1 set (ISO-IR 
90); 

ESC 02/10 06/12 : to designate the supplementary set as the G2 set; 

ESC 02/11 06/12 : to designate the supplementary set as the G3 set. 


10.4 Identification of subrepertoire 


The subrepertoire adopted shall be identified by the control function IDENTIFY GRAPHIC SUBREPERTOIRE (IGS) 
which is defined in ISO/IEC 10538. Parameter values identifying graphic character subrepertoires are registered 
in accordance with ISO/IEC 7350. 


Table 1 - Primary and supplementary sets of graphic characters and non-spacing diacritical marks for 
text communication 
(coding when represented by bit combinations 02/01 to 07/14 and 10/00 to 15/15 of an 8-bit code) 


00-01 02 03 04 05 06-07 06 09 10 11.42 13.14 15 


и iD шы ыш 
A BERD  MEMMELX 
a HEHEHEHE BEBO 
А — HHHHEHH ныыпыг 


B _ ПЫШ eee 
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Table 2 - Specification of the primary character set in an 8-bit code 


Bit Name 
comb. 


05/00 LATIN CAPITAL LETTER P 


EXCLAMATION MARK LATIN CAPITAL LETTER Q 


02/02 QUOTATION MARK 05/02 LATIN CAPITAL LETTER R 


02/03 NUMBER SIGN 05/03 LATIN CAPITAL LETTER S 


DOLLAR SIGN LATIN CAPITAL LETTER T 


02/05 PERCENT SIGN 05/05 LATIN CAPITAL LETTER U 


LATIN CAPITAL LETTER W 

LEFT SQUARE BRACKET 

02702 
с N 


G ce MEE e) 
GRAVE ACCENT 


DIGIT ONE LATIN SMALL LETTER A 


03/02 DIGIT TWO 06/02 LATIN SMALL LETTER B 


LATIN SMALL LETTER D 
LATIN SMALL LETTER Н 


с у а DNE: 
LATIN SMALL LETTER S 
LATIN SMALL LETTER W 
[Wis | lATNCAPTALIETERO |] [CS 


Table 3 - Specification of the supplementary character set in ап 8-bit code 


Bit Name Bit 
comb. comb. 
10/00 NO-BREAK SPACE 13/00 HORIZONTAL BAR 


UPERSCRIPT ONE 
1003 COPYRIGHT SIGN 

TRADE MARK SIGN 

NOT SIGN 


10/10 LEFT DOUBLE QUOTATION MARK 13/10 The position shall not be used) 


10/11 LEFT-POINTING DOUBLE ANGLE 13/11 (This position shall not be used) 
QUOTATION MARK 


10/12 LEFTWARDS ARROW 13/12 VULGAR FRACTION ONE EIGHTH 


10/13 UPWARDS ARROW 13/13 VULGAR FRACTION THREE EIGHTHS 
10/14 RIGHTWARDS ARROW 13/14 VULGAR FRACTION FIVE EIGHTHS 
10/15 DOWNWARDS ARROW 13/15 VULGAR FRACTION SEVEN EIGHTH 


| 
11700 | DEGREE SIGN 14/00 | OHM SIGN 
11701 f PLUS-MINUS SIGN 14/01 — | LATIN CAPITAL LETTER AE 


11/06 PILCROW SIGN 14/06 LATIN CAPITAL LIGATURE IJ 


11/07 MIDDLE DOT 14/07 LATIN CAPITAL LETTER L WITH MIDDLE DOT 


11/08 LATIN CAPITAL LETTER L WITH STROKE 

mm LATIN CAPITAL LIGATURE ОЕ 

at HT-POINTING DOUBLE ANGLE 14/11 MASCULINE ORDINAL INDICATOR 
BUTTON MARK 


| 1112 | VULGAR FRACTION ONE QUARTER 14/12 LATIN CAPITAL LETTER THORN 
ins VULGAR FRACTION ONE HALF 14/13 LATIN CAPITAL LETTER T WITH STROKE 


11/14 VULGAR FRACTION THREE QUARTER 14/14 LATIN CAPITAL LETTER EN 


11/15 INVERTED QUESTION MARK 14/15 LATIN SMALL LETTER N PRECEDED BY 
APOSTROPHE 


a ed 
12100 LATIN SMALL LETTER КАА 

12106 LATIN SMALL LIGATURE T 

ram LATIN SMALL LETTER SHARP 
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Table 4 - Specification of the repertoire 


oded representation 
U A N 0 


A 02/00 
AMPERSAND 02/06 
APOSTROPHE 02/07 
ASTERISK 02/10 
BREVE 02/00 
BROKEN BAR 

CARON 02/00 
CEDILLA 02/00 
CENT SIGN 

CIRCUMFLEX ACCENT 05/14 
COLON 03/10 
COMMA 02/12 
COMMERCIAL AT 04/00 
COPYRIGHT SIGN 

CURRENCY SIGN 

DEGREE SIGN 

DIAERESIS 

DIGIT EIGHT 

DIGIT FIVE 

DIGIT FOUR 

DIGIT NINE 

DIGIT ONE 

DIGIT SEVEN 

DIGIT SIX 

DIGIT THREE 

DIGIT TWO 

DIGIT ZERO 

DIVISION SIGN 

DOLLAR SIGN 

DOT ABOVE 

DOUBLE ACUTE ACCENT 

DOWNWARDS ARROW 

EQUALS SIGN 

EXCLAMATION MARK 

FEMININE ORDINAL INDICATOR 

FULL STOP 

GRAVE ACCENT 

GREATER-THAN SIGN 

HORIZONTAL BAR 

HYPHEN-MINUS 

INVERTED EXCLAMATION MARK 

INVERTED QUESTION MARK 

LATIN CAPITAL LETTER A 

LATIN CAPITAL LETTER A WITH ACUTE 

LATIN CAPITAL LETTER A WITH BREVE 

LATIN CAPITAL LETTER A WITH CIRCUMFLEX 

LATIN CAPITAL LETTER A WITH DIAERESIS 

LATIN CAPITAL LETTER A WITH GRAVE 

LATIN CAPITAL LETTER A WITH MACRON 

LATIN CAPITAL LETTER A WITH OGONEK 

LATIN CAPITAL LETTER A WITH RING ABOVE 
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Table 4 - (continued) 


oed representation 
N 0 


LATIN CAPITAL LETTER AE' 

LATIN CAPITAL LETTER B 

LATIN CAPITAL LETTER C 

LATIN CAPITAL LETTER С WITH ACUTE 
LATIN CAPITAL LETTER C WITH CARON 
LATIN CAPITAL LETTER С WITH CEDILLA 
LATIN CAPITAL LETTER С WITH CIRCUMFLEX 
LATIN CAPITAL LETTER С WITH DOT ABOVE 
LATIN CAPITAL LETTER D 

LATIN CAPITAL LETTER D WITH CARON 
LATIN CAPITAL LETTER D WITH STROKE 
LATIN CAPITAL LETTER E 

LATIN CAPITAL LETTER E WITH ACUTE 
LATIN CAPITAL LETTER E WITH CARON 
LATIN CAPITAL LETTER E WITH CIRCUMFLEX 
LATIN CAPITAL LETTER E WITH DIAERESIS 
LATIN CAPITAL LETTER E WITH DOT ABOVE 
LATIN CAPITAL LETTER E WITH GRAVE 
LATIN CAPITAL LETTER E WITH MACRON 
LATIN CAPITAL LETTER E WITH OGONEK 
LATIN CAPITAL LETTER ENG 

LATIN CAPITAL LETTER F 

LATIN CAPITAL LETTER G 

LATIN CAPITAL LETTER G WITH BREVE 
LATIN CAPITAL LETTER G WITH CEDILLA 
LATIN CAPITAL LETTER G WITH CIRCUMFLEX 
LATIN CAPITAL LETTER G WITH DOT ABOVE 
LATIN CAPITAL LETTER H 

LATIN CAPITAL LETTER H WITH CIRCUMFLEX 
LATIN CAPITAL LETTER H WITH STROKE 
LATIN CAPITAL LETTER | 

LATIN CAPITAL LETTER | WITH ACUTE 

LATIN CAPITAL LETTER | WITH CIRCUMFLEX 
LATIN CAPITAL LETTER | WITH DIAERESIS 
LATIN CAPITAL LETTER | WITH DOT ABOVE 
LATIN CAPITAL LETTER | WITH GRAVE 

LATIN CAPITAL LETTER | WITH MACRON 
LATIN CAPITAL LETTER | WITH OGONEK 
LATIN CAPITAL LETTER | WITH TILDE 

LATIN CAPITAL LETTER J 

LATIN CAPITAL LETTER J WITH CIRCUMFLEX 
LATIN CAPITAL LETTER K 

LATIN CAPITAL LETTER K WITH CEDILLA 
LATIN CAPITAL LETTER L 

LATIN CAPITAL LETTER L WITH ACUTE 
LATIN CAPITAL LETTER L WITH CARON 
LATIN CAPITAL LETTER L WITH CEDILLA 
LATIN CAPITAL LETTER L WITH MIDDLE DOT 
LATIN CAPITAL LETTER L WITH STROKE 
LATIN CAPITAL LETTER M 


NOTE 1 This letter was named LATIN CAPITAL LIGATURE A E in the 1994 edition of 
this International Standard. The name has been aligned with that in ISO/IEC 10646-1. 
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Table 4 - (continued) 


Coded representation 


LATIN CAPITAL LETTER N WITH ACUTE 
LATIN CAPITAL LETTER N WITH CARON 
LATIN CAPITAL LETTER N WITH CEDILLA 
LATIN CAPITAL LETTER N WITH TILDE 

LATIN CAPITAL LETTER О 

LATIN CAPITAL LETTER O WITH ACUTE 
LATIN CAPITAL LETTER O WITH CIRCUMFLEX 
LATIN CAPITAL LETTER O WITH DIAERESIS 
LATIN CAPITAL LETTER O WITH DOUBLE ACUTE 
LATIN CAPITAL LETTER O WITH GRAVE 
LATIN CAPITAL LETTER O WITH MACRON 
LATIN CAPITAL LETTER O WITH STROKE 
LATIN CAPITAL LETTER О WITH TILDE 

LATIN CAPITAL LETTER P 

LATIN CAPITAL LETTER Q 

LATIN CAPITAL LETTER R 

LATIN CAPITAL LETTER R WITH ACUTE 
LATIN CAPITAL LETTER R WITH CARON 
LATIN CAPITAL LETTER R WITH CEDILLA 
LATIN CAPITAL LETTER S 

LATIN CAPITAL LETTER S WITH ACUTE 

LATIN CAPITAL LETTER S WITH CARON 
LATIN CAPITAL LETTER S WITH CEDILLA 
LATIN CAPITAL LETTER S WITH CIRCUMFLEX 
LATIN CAPITAL LETTER T 

LATIN CAPITAL LETTER T WITH CARON 
LATIN CAPITAL LETTER T WITH CEDILLA 
LATIN CAPITAL LETTER T WITH STROKE 
LATIN CAPITAL LETTER THORN 

LATIN CAPITAL LETTER U 

LATIN CAPITAL LETTER U WITH ACUTE 
LATIN CAPITAL LETTER U WITH BREVE 
LATIN CAPITAL LETTER U WITH CIRCUMFLEX 
LATIN CAPITAL LETTER U WITH DIAERESIS 
LATIN CAPITAL LETTER U WITH DOUBLE ACUTE 
LATIN CAPITAL LETTER U WITH GRAVE 
LATIN CAPITAL LETTER U WITH MACRON 
LATIN CAPITAL LETTER U WITH OGONEK 
LATIN CAPITAL LETTER U WITH RING ABOVE 
LATIN CAPITAL LETTER U WITH TILDE 

LATIN CAPITAL LETTER V 

LATIN CAPITAL LETTER W 

LATIN CAPITAL LETTER W WITH CIRCUMFLEX 
LATIN CAPITAL LETTER X 

LATIN CAPITAL LETTER Y 

LATIN CAPITAL LETTER Y WITH ACUTE 

LATIN CAPITAL LETTER Y WITH CIRCUMFLEX 
LATIN CAPITAL LETTER Y WITH DIAERESIS 
LATIN CAPITAL LETTER Z 
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Table 4 - (continued) 


LATIN CAPITAL LETTER Z WITH ACUTE 
LATIN CAPITAL LETTER Z WITH CARON 
LATIN CAPITAL LETTER Z WITH DOT ABOVE 
LATIN CAPITAL LIGATURE lJ? 

LATIN CAPITAL LIGATURE OE? 

LATIN SMALL LETTER A 

LATIN SMALL LETTER A WITH ACUTE 
LATIN SMALL LETTER A WITH BREVE 
LATIN SMALL LETTER A WITH CIRCUMFLEX 
LATIN SMALL LETTER A WITH DIAERESIS 
LATIN SMALL LETTER A WITH GRAVE 
LATIN SMALL LETTER A WITH MACRON 
LATIN SMALL LETTER A WITH OGONEK 
LATIN SMALL LETTER A WITH RING ABOVE 
LATIN SMALL LETTER A WITH TILDE 

LATIN SMALL LETTER AE? 

LATIN SMALL LETTER B 

LATIN SMALL LETTER C 

LATIN SMALL LETTER C WITH ACUTE 
LATIN SMALL LETTER C WITH CARON 
LATIN SMALL LETTER C WITH CEDILLA 
LATIN SMALL LETTER C WITH CIRCUMFLEX 
LATIN SMALL LETTER C WITH DOT ABOVE 
LATIN SMALL LETTER D 

LATIN SMALL LETTER D WITH CARON 
LATIN SMALL LETTER D WITH STROKE 
LATIN SMALL LETTER DOTLESS | 

LATIN SMALL LETTER E 

LATIN SMALL LETTER E WITH ACUTE 
LATIN SMALL LETTER E WITH CARON 
LATIN SMALL LETTER E WITH CIRCUMFLEX 
LATIN SMALL LETTER E WITH DIAERESIS 
LATIN SMALL LETTER E WITH DOT ABOVE 
LATIN SMALL LETTER E WITH GRAVE 
LATIN SMALL LETTER E WITH MACRON 
LATIN SMALL LETTER E WITH OGONEK 
LATIN SMALL LETTER ENG 

LATIN SMALL LETTER ETH 

LATIN SMALL LETTER F 

LATIN SMALL LETTER G 

LATIN SMALL LETTER G WITH BREVE 


oded representation 
12/02 


NOTE 2 In the Dutch language, LATIN CAPITAL LIGATURE lJ is considered as a letter, 
in the French language LATIN CAPITAL LIGATURE OE is considered a letter. 


NOTES This letter was named LATIN SMALL LIGATURE A E in the 1994 edition of this 
International Standard. The name has been aligned with that in ISO/IEC 10646-1. 
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Table 4 - (continued) 


oded representation 
12/0 


LATIN SMALL LETTER G WITH CEDILLA’ 
LATIN SMALL LETTER G WITH CIRCUMFLEX 
LATIN SMALL LETTER G WITH DOT ABOVE 
LATIN SMALL LETTER H 

LATIN SMALL LETTER H WITH CIRCUMFLEX 
LATIN SMALL LETTER H WITH STROKE 
LATIN SMALL LETTER | 

LATIN SMALL LETTER | WITH ACUTE 

LATIN SMALL LETTER | WITH CIRCUMFLEX 
LATIN SMALL LETTER | WITH DIAERESIS 
LATIN SMALL LETTER | WITH GRAVE 

LATIN SMALL LETTER | WITH MACRON 
LATIN SMALL LETTER | WITH OGONEK 
LATIN SMALL LETTER | WITH TILDE 

LATIN SMALL LETTER J 

LATIN SMALL LETTER J WITH CIRCUMFLEX 
LATIN SMALL LETTER K 

LATIN SMALL LETTER K WITH CEDILLA 
LATIN SMALL LETTER KRA 

LATIN SMALL LETTER L 

LATIN SMALL LETTER L WITH ACUTE 

LATIN SMALL LETTER L WITH CARON 
LATIN SMALL LETTER L WITH CEDILLA 
LATIN SMALL LETTER L WITH MIDDLE DOT 
LATIN SMALL LETTER L WITH STROKE 
LATIN SMALL LETTER M 

LATIN SMALL LETTER N 

LATIN SMALL LETTER N PRECEDED BY APOSTROPHE 
LATIN SMALL LETTER N WITH ACUTE 

LATIN SMALL LETTER N WITH CARON 
LATIN SMALL LETTER N WITH CEDILLA 
LATIN SMALL LETTER N WITH TILDE 

LATIN SMALL LETTER O 

LATIN SMALL LETTER O WITH ACUTE 
LATIN SMALL LETTER O WITH CIRCUMFLEX 
LATIN SMALL LETTER O WITH DIAERESIS 
LATIN SMALL LETTER O WITH DOUBLE ACUTE 
LATIN SMALL LETTER O WITH GRAVE 
LATIN SMALL LETTER O WITH MACRON 
LATIN SMALL LETTER O WITH STROKE 
LATIN SMALL LETTER O WITH TILDE 

LATIN SMALL LETTER P 

LATIN SMALL LETTER Q 


NOTE 4 Accented letter LATIN SMALL LETTER G WITH CEDILLA was named "small g with 
acute accent" in the 1983 edition of this International Standard. For compatibility purposes, the 
coded representation has been kept unchanged. The name has been aligned with that in ISO/IEC 
10646-1. 
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Table 4 - (continued) 


aded representation 


LATIN SMALL LETTER R 

LATIN SMALL LETTER R WITH ACUTE 
LATIN SMALL LETTER R WITH CARON 
LATIN SMALL LETTER R WITH CEDILLA 
LATIN SMALL LETTER S 

LATIN SMALL LETTER S WITH ACUTE 

LATIN SMALL LETTER S WITH CARON 
LATIN SMALL LETTER S WITH CEDILLA 
LATIN SMALL LETTER S WITH CIRCUMFLEX 
LATIN SMALL LETTER SHARP S 

LATIN SMALL LETTER T 

LATIN SMALL LETTER T WITH CARON 
LATIN SMALL LETTER T WITH CEDILLA 
LATIN SMALL LETTER T WITH STROKE 
LATIN SMALL LETTER THORN 

LATIN SMALL LETTER U 

LATIN SMALL LETTER U WITH ACUTE 
LATIN SMALL LETTER U WITH BREVE 
LATIN SMALL LETTER U WITH CIRCUMFLEX 
LATIN SMALL LETTER U WITH DIAERESIS 
LATIN SMALL LETTER U WITH DOUBLE ACUTE 
LATIN SMALL LETTER U WITH GRAVE 
LATIN SMALL LETTER U WITH MACRON 
LATIN SMALL LETTER U WITH OGONEK 
LATIN SMALL LETTER U WITH RING ABOVE 
LATIN SMALL LETTER U WITH TILDE 

LATIN SMALL LETTER V 

LATIN SMALL LETTER W 

LATIN SMALL LETTER W WITH CIRCUMFLEX 
LATIN SMALL LETTER X 

LATIN SMALL LETTER Y 

LATIN SMALL LETTER Y WITH ACUTE 

LATIN SMALL LETTER Y WITH CIRCUMFLEX 
LATIN SMALL LETTER Y WITH DIAERESIS 
LATIN SMALL LETTER Z 

LATIN SMALL LETTER Z WITH ACUTE 

LATIN SMALL LETTER Z WITH CARON 
LATIN SMALL LETTER Z WITH DOT ABOVE 
LATIN SMALL LIGATURE IJ* 

LATIN SMALL LIGATURE OE? 

LEFT CURLY BRACKET 

LEFT DOUBLE QUOTATION MARK 


NOTE 5 In the Dutch language, LATIN SMALL LIGATURE IJ is considered as a letter, and in the 
French language LATIN SMALL LIGATURE OE is considered a letter. 
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Table 4 - (concluded) 


aded representation 


LEFT PARENTHESIS 
LEFT-POINTING DOUBLE ANGLE QUOTATION MARK 
LEFT SINGLE QUOTATION MARK 
LEFT SQUARE BRACKET 
LEFTWARDS ARROW 

LESS-THAN SIGN 

LOW LINE 

MACRON 

MASCULINE ORDINAL INDICATOR 
MICRO SIGN 

MIDDLE DOT 

MULTIPLICATION SIGN 

EIGHTH NOTE 

NO-BREAK SPACE 

NOT SIGN 

NUMBER SIGN 

OGONEK 

OHM SIGN 

PERCENT SIGN 

PILCROW SIGN 

PLUS SIGN 

PLUS-MINUS SIGN 

POUND SIGN 

QUESTION MARK 

QUOTATION MARK 

REGISTERED SIGN 

REVERSE SOLIDUS 

RIGHT CURLY BRACKET 

RIGHT DOUBLE QUOTATION MARK 
RIGHT PARENTHESIS 
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK 
RIGHT SINGLE QUOTATION MARK 
RIGHT SQUARE BRACKET 
RIGHTWARDS ARROW 

RING ABOVE 

SECTION SIGN 

SEMICOLON 

SOFT HYPHEN 

SOLIDUS 

SPACE 

SUPERSCRIPT ONE 

SUPERSCRIPT THREE 
SUPERSCRIPT TWO 

TILDE 

TRADE MARK SIGN 

UPWARDS ARROW 

VERTICAL LINE 

VULGAR FRACTION FIVE EIGHTHS 
VULGAR FRACTION ONE EIGHTH 
VULGAR FRACTION ONE HALF 
VULGAR FRACTION ONE QUARTER 
VULGAR FRACTION SEVEN EIGHTHS 
VULGAR FRACTION THREE EIGHTHS 
VULGAR FRACTION THREE QUARTERS 
YEN SIGN 
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Аппех А 
(normative) 


7-bit code 


This Annex specifies the 7-bit code for the character sets of this International Standard. 


Notation (see 5.1): The bits of the bit combinations of the 7-bit code are identified by b7, bg: 05, 04, bg, b» and 


b4, where bz is the highest-order, or most significant bit and b4 is the lowest-order, or least significant bit. 


The bit combinations may be interpreted to represent numbers in the range 0 to 127 in binary notation by attributing 
the following weights to the individual bits: 


In this International Standard, the bit combinations are identified by notations of the form xx/yy, where xx is a 
number in the range 00 to 07 and yy a number in the range 00 to 15. The correspondence between the notations 
of the form xx/yy and the bit combinations consisting of the bits bz to b4, is as follows: 


- xx is the number represented by b7, bg and bs where these bits are given the weights 4, 2 and 1, respectively; 


- yy is the number represented by by, bg, bo апа b4 where these bits are given the weights 8, 4, 2 and 1, 
respectively. 


The notations of the form xx/yy are the same as the ones used to identify code table positions, where xx is the 
column number and yy is the row number (see 5.2). 


Code table (see 5.2): A 7-bit code table consists of 128 positions arranged in 8 columns and 16 rows. The 
columns are numbered 00 to 07 and the rows are numbered 00 to 15. 


GO, G1, G2 and G3 sets: In a 7-bit code, the elements of a GO set are represented by bit combinations in the 
range 02/01 to 07/14, and the elements of a G1, G2 or G3 set of graphic characters are also represented by bit 
combinations in the range 02/00 to 07/15 after invocation by the appropriate code extension function in accordance 
with ISO 2022. 
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Table A.1 - Primary set of graphic characters for text communication (coding when represented by bit 
combinations 02/01 to 07/14 of a 7-bit code) 
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Table A.2 - Supplementary set of graphic characters апа non-spacing diacritical marks for text 
communication (coding when represented by bit combinations 02/00 to 07/15 of a 7-bit code) 


ddl ERE 
e ер а 
de i ЖЫЙ ШЫ ШЕ r 
ie БЖЮЙЕЛКЕНУ 
ек пг 7 EN 
Wl —ENEEENENEN 
2 
ШЫ акани NEN 
ШЫ ШШШ ИЛЕШ 
nnn ККИ 
Hes БЕКИ 
Ш КЫН | PP 
B | АШИ БЕ | 
pe 
ДСО Del je 


Аппех В 
(informative) 


Method of definition of short identifiers of this International Standard 


Characters are identified by their names as specified in the repertoire. In certain applications, these names may 
be too long for referencing. To serve this situation, a system of short identifiers is introduced. 


NOTE 14 In the 1983 edition of this International Standard, these short identifiers were called "identifiers", and 
intended to identify characters. This practice is not continued in this International Standard, and is in fact 
deprecated. 


For the purpose of this International Standard, a method has been developed which allows for a short form of 
identification of graphic characters. The method is shown in figure B.1. 


Each short identifier consists of two capital letters and two digits. 


The first letter indicates an alphabet or a character category (in the case of a non-alphabetic graphic character). 
Only L, N and S are used in this Annex, the other capital letters are reserved for future use. 


The second letter indicates a letter of the alphabet or, in the case of a non-alphabetic graphic character, the group 
of characters. 


In the case of an alphabetic character, the first digit indicates the presence of a diacritical mark or a special form, 
and the second digit indicates whether it is a capital or a small letter. The digits have no special meaning when 
the short identifier begins with an N or an S. 


The numbering is used in a consistent manner so that each diacritical mark is always given the same number. 
The numbering principle is shown in figure B.2. 


Table B.1 provides the lists of short identifiers and names for the graphic characters of the repertoire defined in 
this International Standard. 


NOTE 15: The following short identifiers have been changed from the second edition to the third edition: 


old new character 

LA51 LA61 LATIN CAPITAL LETTER AE 

LA52 LA62 LATIN SMALL LETTER AE 

LG11 1641 LATIN CAPITAL LETTER G WITH CEDILLA 
LI51 LI63 LATIN CAPITAL LIGATURE IJ 

LI5S2 LI64 LATIN SMALL LIGATURE lJ 

LO51 1063 LATIN CAPITAL LIGATURE OE 

LO52 LO64 LATIN SMALL LIGATURE OE 


and the catogorv LIGATURE has been removed from the method of definition of short identifiers. 
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А 0 1 


| 

| 

|. For alphabetic characters: 
| odd digit = small letter; 

| even digit = capital letter. 
| 


о Мог S in first position: 
no special meaning. 


| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
|. Fer alphabetic characters: 
| 0 = letter without diacritical mark; 
| 1 to 3 = letter with diacrital mark above it; 
| 4 - letter with diacritical mark below it; 
| 6 = special form. 
Г. N or S in first position: 

no special meaning. 


LL For alphabetic characters: 
A to Z = the respective letter of the Latin alphabet. 


LL IK Nin first position: 
D - digit; 
F « fraction; 
S = subscript or superscript. 


LL If Sin first position: 
A - arithmetic sign; 
C = currency sign; 
D - diacritical mark; 
P = punctuation mark; 
M = other symbol (miscellaneous). 


_________ For all graphic characters: 


L = Latin alphabetic character; 
М = numeric graphic character; 
S = special graphic character. 


Figure B.1 - Method of definition of short identifiers 
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No diacritical mark 
ACUTE ACCENT 
GRAVE ACCENT 
CIRCUMFLEX ACCENT 
DIAERESIS 

TILDE 

CARON 

BREVE 

DOUBLE ACUTE ACCENT 
RING ABOVE 

DOT ABOVE 

MACRON 

CEDILLA 

OGONEK 


Special forms: 

AE, D, Н, L, T WITH STROKE 
DOTLESS I 

O WITH STROKE 

KRA 

ENG 

SHARP S 

ETH (see note 12 in clause 8.3) 
L WITH MIDDLE DOT 

N PRECEDED BY APOSTROPHE 
THORN 

IJ, OE 


Figure B.2 - Numbering principle for alphabetic characters 


Small 


63 


Capital 
02 
12 
14 
16 
18 
20 
22 
24 
26 
28 
30 
32 
42 
44 
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Table B.1 - List of short identifiers for the repertoire in alphabetic order 
of character names 


AMPERSAND 

APOSTROPHE 

ASTERISK 

BREVE 

BROKEN BAR 

CARON 

CEDILLA 

CENT SIGN 

CIRCUMFLEX ACCENT 

COLON 

COMMA 

COMMERCIAL AT 

COPYRIGHT SIGN 

CURRENCY SIGN 

DEGREE SIGN 

DIAERESIS 

DIGIT EIGHT 

DIGIT FIVE 

DIGIT FOUR 

DIGIT NINE 

DIGIT ONE 

DIGIT SEVEN 

DIGIT SIX 

DIGIT THREE 

DIGIT TWO 

DIGIT ZERO 

DIVISION SIGN 

DOLLAR SIGN 

DOT ABOVE 

DOUBLE ACUTE ACCENT 

EIGHTH NOTE 

DOWNWARDS ARROW 

EQUALS SIGN 

EXCLAMATION MARK 

FEMININE ORDINAL INDICATOR 

FULL STOP 

GRAVE ACCENT 

GREATER-THAN SIGN 

HORIZONTAL BAR 

HYPHEN-MINUS 

INVERTED EXCLAMATION MARK 
INVERTED QUESTION MARK 

LATIN CAPITAL LETTER A 

LATIN CAPITAL LETTER A WITH ACUTE 
LATIN CAPITAL LETTER A WITH BREVE 
LATIN CAPITAL LETTER A WITH CIRCUMFLEX 
LATIN CAPITAL LETTER A WITH DIAERESIS 
LATIN CAPITAL LETTER A WITH GRAVE 
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Table B.1 - (continued) 


LATIN CA R A WITH OGONEK 
LATIN CAPITAL LETTER A WITH RING ABOVE 
LATIN CAPITAL LETTER A WITH TILDE 

LATIN CAPITAL LETTER AE 

LATIN CAPITAL LETTER B 

LATIN CAPITAL LETTER C 

LATIN CAPITAL LETTER C WITH ACUTE 
LATIN CAPITAL LETTER C WITH CARON 
LATIN CAPITAL LETTER C WITH CEDILLA 
LATIN CAPITAL LETTER C WITH CIRCUMFLEX 
LATIN CAPITAL LETTER C WITH DOT ABOVE 
LATIN CAPITAL LETTER D 

LATIN CAPITAL LETTER D WITH CARON 
LATIN CAPITAL LETTER D WITH STROKE 
LATIN CAPITAL LETTER E 

LATIN CAPITAL LETTER E WITH ACUTE 
LATIN CAPITAL LETTER E WITH CARON 
LATIN CAPITAL LETTER E WITH CIRCUMFLEX 
LATIN CAPITAL LETTER E WITH DIAERESIS 
LATIN CAPITAL LETTER E WITH DOT ABOVE 
LATIN CAPITAL LETTER E WITH GRAVE 
LATIN CAPITAL LETTER E WITH MACRON 
LATIN CAPITAL LETTER E WITH OGONEK 
LATIN CAPITAL LETTER ENG 

LATIN CAPITAL LETTER F 

LATIN CAPITAL LETTER G 

LATIN CAPITAL LETTER G WITH BREVE 
LATIN CAPITAL LETTER G WITH CEDILLA 
LATIN CAPITAL LETTER G WITH CIRCUMFLEX 
LATIN CAPITAL LETTER G WITH DOT ABOVE 
LATIN CAPITAL LETTER H 

LATIN CAPITAL LETTER H WITH CIRCUMFLEX 
LATIN CAPITAL LETTER H WITH STROKE 
LATIN CAPITAL LETTER | 

LATIN CAPITAL LETTER | WITH ACUTE 

LATIN CAPITAL LETTER | WITH CIRCUMFLEX 
LATIN CAPITAL LETTER | WITH DIAERESIS 
LATIN CAPITAL LETTER | WITH DOT ABOVE 
LATIN CAPITAL LETTER | WITH GRAVE 

LATIN CAPITAL LETTER | WITH MACRON 
LATIN CAPITAL LETTER | WITH OGONEK 
LATIN CAPITAL LETTER | WITH TILDE 

LATIN CAPITAL LETTER J 

LATIN CAPITAL LETTER J WITH CIRCUMFLEX 
LATIN CAPITAL LETTER K 

LATIN CAPITAL LETTER K WITH CEDILLA 
LATIN CAPITAL LETTER L 

LATIN CAPITAL LETTER L WITH ACUTE 


Table B.1 - (continued) 


AD D 


LATIN CAPITAL LETTER L WITH CEDILLA 
LATIN CAPITAL LETTER L WITH MIDDLE DOT 
LATIN CAPITAL LETTER L WITH STROKE 
LATIN CAPITAL LETTER M 

LATIN CAPITAL LETTER N 

LATIN CAPITAL LETTER N WITH ACUTE 
LATIN CAPITAL LETTER N WITH CARON 
LATIN CAPITAL LETTER N WITH CEDILLA 
LATIN CAPITAL LETTER N WITH TILDE 

LATIN CAPITAL LETTER O 

LATIN CAPITAL LETTER O WITH ACUTE 
LATIN CAPITAL LETTER O WITH CIRCUMFLEX 
LATIN CAPITAL LETTER O WITH DIAERESIS 
LATIN CAPITAL LETTER O WITH DOUBLE ACUTE 
LATIN CAPITAL LETTER O WITH GRAVE 
LATIN CAPITAL LETTER O WITH MACRON 
LATIN CAPITAL LETTER O WITH STROKE 
LATIN CAPITAL LETTER O WITH TILDE 

LATIN CAPITAL LETTER P 

LATIN CAPITAL LETTER Q 

LATIN CAPITAL LETTER R 

LATIN CAPITAL LETTER R WITH ACUTE 
LATIN CAPITAL LETTER R WITH CARON 
LATIN CAPITAL LETTER R WITH CEDILLA 
LATIN CAPITAL LETTER S 

LATIN CAPITAL LETTER S WITH ACUTE 

LATIN CAPITAL LETTER S WITH CARON 
LATIN CAPITAL LETTER S WITH CEDILLA 
LATIN CAPITAL LETTER S WITH CIRCUMFLEX 
LATIN CAPITAL LETTER T 

LATIN CAPITAL LETTER T WITH CARON 
LATIN CAPITAL LETTER T WITH CEDILLA 
LATIN CAPITAL LETTER T WITH STROKE 
LATIN CAPITAL LETTER THORN 

LATIN CAPITAL LETTER U 

LATIN CAPITAL LETTER U WITH ACUTE 
LATIN CAPITAL LETTER U WITH BREVE 
LATIN CAPITAL LETTER U WITH CIRCUMFLEX 
LATIN CAPITAL LETTER U WITH DIAERESIS 
LATIN CAPITAL LETTER U WITH DOUBLE ACUTE 
LATIN CAPITAL LETTER U WITH GRAVE 
LATIN CAPITAL LETTER U WITH MACRON 
LATIN CAPITAL LETTER U WITH OGONEK 
LATIN CAPITAL LETTER U WITH RING ABOVE 
LATIN CAPITAL LETTER U WITH TILDE 

LATIN CAPITAL LETTER V 

LATIN CAPITAL LETTER W 
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Table B.1 - (continued) 


LATIN CA RX 

LATIN CAPITAL LETTER Y 

LATIN CAPITAL LETTER Y WITH ACUTE 
LATIN CAPITAL LETTER Y WITH CIRCUMFLEX 
LATIN CAPITAL LETTER Y WITH DIAERESIS 
LATIN CAPITAL LETTER Z 

LATIN CAPITAL LETTER Z WITH ACUTE 
LATIN CAPITAL LETTER Z WITH CARON 
LATIN CAPITAL LETTER Z WITH DOT ABOVE 
LATIN CAPITAL LIGATURE IJ 

LATIN CAPITAL LIGATURE OE 

LATIN SMALL LETTER A 

LATIN SMALL LETTER A WITH ACUTE 

LATIN SMALL LETTER A WITH BREVE 

LATIN SMALL LETTER A WITH CIRCUMFLEX 
LATIN SMALL LETTER A WITH DIAERESIS 
LATIN SMALL LETTER A WITH GRAVE 
LATIN SMALL LETTER A WITH MACRON 
LATIN SMALL LETTER A WITH OGONEK 
LATIN SMALL LETTER A WITH RING ABOVE 
LATIN SMALL LETTER A WITH TILDE 

LATIN SMALL LETTER AE 

LATIN SMALL LETTER B 

LATIN SMALL LETTER C 

LATIN SMALL LETTER C WITH ACUTE 
LATIN SMALL LETTER C WITH CARON 
LATIN SMALL LETTER C WITH CEDILLA 
LATIN SMALL LETTER C WITH CIRCUMFLEX 
LATIN SMALL LETTER C WITH DOT ABOVE 
LATIN SMALL LETTER D 

LATIN SMALL LETTER D WITH CARON 
LATIN SMALL LETTER D WITH STROKE 
LATIN SMALL LETTER DOTLESS | 

LATIN SMALL LETTER E 

LATIN SMALL LETTER E WITH ACUTE 
LATIN SMALL LETTER E WITH CARON 
LATIN SMALL LETTER E WITH CIRCUMFLEX 
LATIN SMALL LETTER E WITH DIAERESIS 
LATIN SMALL LETTER E WITH DOT ABOVE 
LATIN SMALL LETTER E WITH GRAVE 
LATIN SMALL LETTER E WITH MACRON 
LATIN SMALL LETTER E WITH OGONEK 
LATIN SMALL LETTER ENG 

LATIN SMALL LETTER ETH 

LATIN SMALL LETTER F 

LATIN SMALL LETTER G 

LATIN SMALL LETTER G WITH BREVE 
LATIN SMALL LETTER G WITH CEDILLA 


Table B.1 - (continued) 


LATIN SMALL LETTER G WITH DOT ABOVE 
LATIN SMALL LETTER H 

LATIN SMALL LETTER H WITH CIRCUMFLEX 
LATIN SMALL LETTER H WITH STROKE 
LATIN SMALL LETTER | 

LATIN SMALL LETTER | WITH ACUTE 
LATIN SMALL LETTER | WITH CIRCUMFLEX 
LATIN SMALL LETTER | WITH DIAERESIS 
LATIN SMALL LETTER | WITH GRAVE 
LATIN SMALL LETTER | WITH MACRON 
LATIN SMALL LETTER | WITH OGONEK 
LATIN SMALL LETTER | WITH TILDE 

LATIN SMALL LETTER J 

LATIN SMALL LETTER J WITH CIRCUMFLEX 
LATIN SMALL LETTER K 

LATIN SMALL LETTER K WITH CEDILLA 
LATIN SMALL LETTER KRA 

LATIN SMALL LETTER L 

LATIN SMALL LETTER L WITH ACUTE 
LATIN SMALL LETTER L WITH CARON 
LATIN SMALL LETTER L WITH CEDILLA 
LATIN SMALL LETTER L WITH MIDDLE DOT 
LATIN SMALL LETTER L WITH STROKE 
LATIN SMALL LETTER M 

LATIN SMALL LETTER N 


LATIN SMALL LETTER N PRECEDED BY APOSTROPHE 


LATIN SMALL LETTER N WITH ACUTE 
LATIN SMALL LETTER N WITH CARON 
LATIN SMALL LETTER N WITH CEDILLA 
LATIN SMALL LETTER N WITH TILDE 
LATIN SMALL LETTER O 

LATIN SMALL LETTER O WITH ACUTE 
LATIN SMALL LETTER O WITH CIRCUMFLEX 
LATIN SMALL LETTER O WITH DIAERESIS 
LATIN SMALL LETTER O WITH DOUBLE ACUTE 
LATIN SMALL LETTER O WITH GRAVE 
LATIN SMALL LETTER O WITH MACRON 
LATIN SMALL LETTER O WITH STROKE 
LATIN SMALL LETTER O WITH TILDE 
LATIN SMALL LETTER P 

LATIN SMALL LETTER Q 

LATIN SMALL LETTER R 

LATIN SMALL LETTER R WITH ACUTE 
LATIN SMALL LETTER R WITH CARON 
LATIN SMALL LETTER R WITH CEDILLA 
LATIN SMALL LETTER S 

LATIN SMALL LETTER S WITH ACUTE 
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Table B.1 - (continued) 


LATIN SMALL LETTER S WITH CEDILLA 
LATIN SMALL LETTER S WITH CIRCUMFLEX 
LATIN SMALL LETTER SHARP S 

LATIN SMALL LETTER T 

LATIN SMALL LETTER T WITH CARON 
LATIN SMALL LETTER T WITH CEDILLA 
LATIN SMALL LETTER T WITH STROKE 
LATIN SMALL LETTER THORN 

LATIN SMALL LETTER U 

LATIN SMALL LETTER U WITH ACUTE 
LATIN SMALL LETTER U WITH BREVE 
LATIN SMALL LETTER U WITH CIRCUMFLEX 
LATIN SMALL LETTER U WITH DIAERESIS 
LATIN SMALL LETTER U WITH DOUBLE ACUTE 
LATIN SMALL LETTER U WITH GRAVE 
LATIN SMALL LETTER U WITH MACRON 
LATIN SMALL LETTER U WITH OGONEK 
LATIN SMALL LETTER U WITH RING ABOVE 
LATIN SMALL LETTER U WITH TILDE 

LATIN SMALL LETTER V 

LATIN SMALL LETTER W 

LATIN SMALL LETTER W WITH CIRCUMFLEX 
LATIN SMALL LETTER X 

LATIN SMALL LETTER Y 

LATIN SMALL LETTER Y WITH ACUTE 

LATIN SMALL LETTER Y WITH CIRCUMFLEX 
LATIN SMALL LETTER Y WITH DIAERESIS 
LATIN SMALL LETTER Z 

LATIN SMALL LETTER Z WITH ACUTE 

LATIN SMALL LETTER Z WITH CARON 
LATIN SMALL LETTER Z WITH DOT ABOVE 
LATIN SMALL LIGATURE lJ 

LATIN SMALL LIGATURE OE 

LEFT CURLY BRACKET 

LEFT DOUBLE QUOTATION MARK 

LEFT PARENTHESIS 

LEFT-POINTING DOUBLE ANGLE QUOTATION MARK 
LEFT SINGLE QUOTATION MARK 

LEFT SQUARE BRACKET 

LEFTWARDS ARROW 

LESS-THAN SIGN 

LOW LINE 

MACRON 

MASCULINE ORDINAL INDICATOR 

MICRO SIGN 

MIDDLE DOT 


Table B.1 - (concluded) 


NO-BREAK SPACE 

NOT SIGN 

NUMBER SIGN 

OGONEK 

OHM SIGN 

PERCENT SIGN 

PILCROW SIGN 

PLUS SIGN 

PLUS-MINUS SIGN 

POUND SIGN 

QUESTION MARK 

QUOTATION MARK 

REGISTERED SIGN 

REVERSE SOLIDUS 

RIGHT CURLY BRACKET 

RIGHT DOUBLE QUOTATION MARK 
RIGHT PARENTHESIS 
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK 
RIGHT SINGLE QUOTATION MARK 
RIGHT SQUARE BRACKET 
RIGHTWARDS ARROW 

RING ABOVE 

SECTION SIGN 

SEMICOLON 

SOFT HYPHEN 

SOLIDUS 

SPACE 

SUPERSCRIPT ONE 

SUPERSCRIPT THREE 
SUPERSCRIPT TWO 

TILDE 

TRADE MARK SIGN 

UPWARDS ARROW 

VERTICAL LINE 

VULGAR FRACTION FIVE EIGHTHS 
VULGAR FRACTION ONE EIGHTH 
VULGAR FRACTION ONE HALF 
VULGAR FRACTION ONE QUARTER 
VULGAR FRACTION SEVEN EIGHTHS 
VULGAR FRACTION THREE EIGHTHS 
VULGAR FRACTION THREE QUARTERS 
YEN SIGN 


Аппех С 
(informative) 


Use of non-spacing diacritical marks 


The supplementary set (see tables 1 and 3) contains 13 non-spacing diacritical marks which are used in 
combination with the letters of the basic Latin alphabet in the primary set, and with SPACE, to represent accented 
letters and diacritical marks as separate graphic characters. 


The combinations of non-spacing diacritical marks and basic letters which are defined in this International Standard 
are given in table C.1 which also gives ligatures and other special letters. 


NOTE 16: The term "non-spacing diacritical mark" is used in this International Standard in a metaphorical sense 
only. The "combination" of a non-spacing diacritical mark with a basic letter does not "generate" a new letter, but 
only indicates how a letter from the repertoire of this International Standard is to be coded. 


Table C.1 - Combinations of diacritical marks and basic letters 


BASIC acute | grave | circum | diae | tilde | caron | breve | double | ring | dot macron | cedilla | ogonek | ligature | others 
LETTER flex resis acute | above | above 


__ 
Ca ЕЕЕ К Н — 
ef fe fe fe ёш а 


8 
mi 


© 
© 


aE | 
L| 
ою | 
NE 
| 
(zm 
NM 
w 
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Аппех О 
(informative) 


Use of Latin alphabetic characters in various languages 


Table D.1 summarizes the use of the Latin alphabetic characters defined in this International Standard in 41 
different languages (39 European languages, Afrikaans and Esperanto). 


The 26 basic letters of the Latin alphabet have not been included in the table because they are considered 
indispensable in all languages, even though several languages do not require letters such as q or w for their own 
orthographies. 


Table D.1 is intended to provide justification for the composition of the alphabetic part of the graphic character 
repertoire. It does not attempt to define which characters should, and which ones should not, be used in any 
language. 


NOTE 16 Usage within any country or areas is to some extent dependent on the text, its intended use and 
its form of presentation. Furthermore, it is common in many languages to include "loan words" taken from other 
languages. The requirements for these spécialités have not been shown in this table except where such loan words 
have such long-standing or widespread use that they are now considered to be "naturalized" rather than "foreign" 
words in a particular language. 


NOTE 17 See note 12 page 7. 


NOTE 18 As a result of a spelling reform of Greenlandic in 1973, the following characters are depreciated, 
but still used in personal names: 

LATIN CAPITAL LETTER | WITH TILDE 

LATIN SMALL LETTER | WITH TILDE 

LATIN SMALL LETTER KRA 

LATIN CAPITAL LETTER U WITH TILDE 

LATIN SMALL LETTER U WITH TILDE 


NOTE 19 For spelling the Welsh language correctly, some more letters are in fact required. They are not 
included in the repertoire, but are only identified here: 

LATIN CAPITAL LETTER W WITH ACUTE 

LATIN SMALL LETTER W WITH ACUTE 

LATIN CAPITAL LETTER W WITH GRAVE 

LATIN SMALL LETTER W WITH GRAVE 

LATIN CAPITAL LETTER W WITH DIAERESIS 

LATIN SMALL LETTER W WITH DIAERESIS 

LATIN CAPITAL LETTER Y WITH GRAVE 

LATIN SMALL LETTER Y WITH GRAVE 


33 


з4 


Table D.1 - Use of Latin alphabetic characters 


Character | LLLLL 


Languages 
Afrikaans 
Albanian 
Basque 
Breton 
Catalan 
Croat 
Czech 
Danish 
Dutch 
English 
Esperanto 
Estonian 
Faroese 
Finnish 
French 
Frisian 
Galician 
German 
Greenlandic 
Hungarian 
Icelandic 
Irish 
Italian 
Lapp (Sami) 
Latvian 
Lithuanian 
Maltese 
Norwegian 
Occitan 
Polish 
Portuguese 
Rhaeto-Romanic 
Romanian 
(Scots) Gaelic 
Slovak 
Slovene 
Sorbian 
Spanish 
Swedish 
Turkish 


Welsh 


Character | LLLLL 


Languages 
Afrikaans 
Albanian 
Basque 
Breton 
Catalan 
Croat 
Czech 
Danish 
Dutch 
English 
Esperanto 
Estonian 
Faroese 
Finnish 
French 
Frisian 
Galician 
German 
Greenlandic 
Hungarian 
Icelandic 
Irish 
Italian 
Lapp (Sami) 
Latvian 
Lithuanian 
Maltese 
Norwegian 
Occitan 
Polish 
Portuguese 
Rhaeto-Romanic 
Romanian 
(Scots) Gaelic 
Slovak 
Slovene 
Sorbian 
Spanish 
Swedish 
Turkish 


Welsh 


Table D.1 - (continued) 
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Character | LLLLL 


Languages 
Afrikaans 
Albanian 
Basque 
Breton 
Catalan 
Croat 
Czech 
Danish 
Dutch 
English 
Esperanto 
Estonian 
Faroese 
Finnish 
French 
Frisian 
Galician 
German 
Greenlandic 
Hungarian 
Icelandic 
Irish 
Italian 
Lapp (Sami) 
Latvian 
Lithuanian 
Maltese 
Norwegian 
Occitan 
Polish 
Portuguese 
Rhaeto-Romanic 
Romanian 
(Scots) Gaelic 
Slovak 
Slovene 
Sorbian 
Spanish 
Swedish 
Turkish 


Welsh 


Table D.1 - (concluded) 


Аппех Е 
(informative) 


Alternative coded representation of the repertoire 
with no non-spacing diacritical marks 


The character repertoire of this International Standard can also be represented in an alternative coding which does 
not require the use of the non-spacing diacritical marks. 


This coded representation is a version of ISO/IEC 4873 Level 2 or 3 that uses the following graphic character sets 
from ISO/IEC 10367: 


- the Basic GO set (ISO-IR 6), 
- Latin alphabet No 1 supplementary set (ISO-IR 100) or Latin alphabet No 5 supplementary set (ISO-IR 148), 
- Latin alphabet No 2 supplementary set (ISO-IR 101), 


- Supplementary set for Latin alphabets No 1 or 5, and 2 (ISO-IR 154). 
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Аппех Е 
(informative) 


Bibliography 
[1] ISO/IEC 4873:1991, Information technology - ISO 8-bit code for information interchange - Structure and rules 
for implementation. 


[2] ISO/IEC 6429:1992, Information technology - Control functions for coded character sets. 


[3] ISO 8859-1:1987, Information processing - 8-bit single-byte coded graphic character sets - Part 1: Latin 
alphabet No 1. 


[4] ISO 8859-2:1987, Information processing - 8-bit single-byte coded graphic character sets - Part 2: Latin 
alphabet No 2. 


[5] ISO/IEC 8859-9:1989, Information processing - 8-bit single-byte coded graphic character sets - Part 9: Latin 
alphabet No 5. 


[6] ISO/IEC 8859-10:1993, /nformation technology - 8-bit single-byte coded graphic character sets - Part 9: Latin 
alphabet No 6. 


38 


Аппех С 


(informative) 


Main differences between the 1994 (second) edition of ISO/IEC 6937, and 
the present (third) edition of this International Standard 


1 Annex G of the second edition was replaced with a new text. 


2 The names of LATIN SMALL and CAPITAL LETTER AE were changed from the 1994 
edition (where they were called LIGATURE), to align with ISO/IEC 10646-1. 


3 For the same reason, the name MUSIC NOTE was changed to EIGHTH NOTE, and 
TRADEMARK SIGN was changed to TRADE MARK SIGN. 


4 The following short identifiers were changed (see annex B, NOTE 15): 
old new 


А51 LA61 
LA52 LA62 
LG11 LG41 
LIS1 LI63 
LIS2 LI64 
LO51 LO63 
LO52 LO64 
SM95 SM65 
SM96 SM66 


5 A number of small corrections and clarifications was applied. 
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